🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ SIMD Vectorization

AVX Instructions, Parallel Data Processing, Compiler Optimization, Performance

When AI optimizations miss the mark: A case study in array shape calculation
questdb.com·1d·
Discuss: Hacker News, r/programming
⚡Performance Mythology
Cracking the Density Code: Why MAF Flows Where KDE Stalls
towardsdatascience.com·13h
🔗Tailscale
Don't Repeat Yourself, Coarse-Grained Circuit Deduplication to Accelerate Sim
danglingpointers.substack.com·16h·
Discuss: Substack
🖥️Game Emulation
Deep Dive: OpenAI's GPT-OSS
dev.to·3h·
Discuss: DEV
📊Quantization
Parallel Reduce and Scan on the GPU
cachemiss.xyz·5d·
Discuss: Hacker News
⚡SIMD Optimization
numexpr: fast numerical array expression evaluator for Python
github.com·1d·
Discuss: Hacker News
🚀SIMD Parsing
Show HN: A short story on developing a long-context World-Model with no money
francesco215.github.io·12h·
Discuss: Hacker News
🧠Learned Codecs
Get Back To WARP
binary.ninja·13h
🧪Binary Fuzzing
Compute Where It Counts: a trainable LLM sparsity enabling 4x CPU speed
crystalai.org·2d·
Discuss: Hacker News
🌊Streaming Algorithms
Sysbench for MySQL 5.6 thru 9.4 on a small server
smalldatum.blogspot.com·1d·
Discuss: smalldatum.blogspot.com
🦀Rusty Databases
UnderColor’s spiral challenge from 1984 – part 3
subethasoftware.com·1d
📺VT100
Using large-scale search to discover fast GPU kernels in Rust
reddit.com·2d·
Discuss: r/rust
🦀Rust Macros
You could have invented CuTe hierarchical layout (but maybe not the rest of it?)
blog.ezyang.com·23h·
Discuss: blog.ezyang.com
⟷Bidirectional Programming
Cursor: 1.5x Faster Moe Training on Blackwell with MXFP8 Kernels
cursor.com·3d·
Discuss: Hacker News, Hacker News
🖥️Game Emulation
Speeding Up AI Coding Assistants Using Deterministic Feedback
proxymock.io·14h·
Discuss: Hacker News
📼Tape Combinators
FFmpeg 8.0 Released
ffmpeg.org·3h·
Discuss: Hacker News
🎬AV1 Encoding
Fast globally optimal Truncated Least Squares point cloud registration with fixed rotation axis
arxiv.org·1d
🌀Riemannian Computing
Writing Your First GPU Kernel in Python with Numba and CUDA
kdnuggets.com·4d
📊RISC-V Vectors
I tried DSPy and now I get why everyone won't shut up about it
pedramnavid.com·26m·
Discuss: Hacker News
🔄Burrows-Wheeler
Why your AI factory’s biggest bottleneck isn’t the GPU — it’s the network
blog.apnic.net·1d
🌊Stream Processing
Loading...Loading more...
AboutBlogChangelogRoadmap